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Abstract 

We investigate a model-theoretic property that generalizes the classical notion of 
"preservation under substructures". We call this property preservation under sub- 
structures modulo bounded cores, and present a syntactic characterization via E c 2 ' 
sentences for properties of arbitrary structures definable by FO sentences. As a 
shaiper characterization, we further show that the count of existential quantifiers 
in the E° sentence equals the size of the smallest bounded core. We also present 
our results on the sharper characterization for special fragments of FO and also 
over special classes of structures. We present a (not FO-definable) class of finite 
structures for which the sharper characterization fails, but for which the classi- 
cal Los-Tarski preservation theorem holds. As a fallout of our studies, we obtain 
combinatorial proofs of the Los-Tarski theorem for some of the aforementioned 
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1 Introduction 

Preservation theorems have traditionally been an important area of study in model the- 
ory. These theorems provide syntactic characterizations of semantic properties that 
are preserved under model-theoretic operations. One of the earliest preservation the- 
orems is the Los-Tarski theorem, which states that over arbitrary structures, a First 
Order (FO) sentence is preserved under taking substructures iff it is equivalent to a 
II" sentence [5]. Subsequently many other preservation theorems were studied, e.g. 
preservation under unions of chains, homomorphisms, direct products, etc. With the 
advent of finite model theory, the question of whether these theorems hold over finite 
structures became interesting. It turned out that several preservation theorems fail in 
the finite [1, 7, 9]. This inspired research on preservation theorems over special classes 
of finite structures, e.g. those with bounded degree, bounded treewidth etc. These ef- 
forts eventually led to some preservation theorems being "recovered" [2, 3]. Among 
the theorems whose status over the class of all finite structures was open for long was 
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the homomorphism preservation theorem. This was recently resolved in [10], which 
showed that the theorem survives in the finite. 

In this paper, we look at a generalization of the 'preservation under substructures' prop- 
erty that we call preservation under substructures modulo bounded cores. In Section 2, 
we show that for FO sentences, this property has a syntactic characterization in terms 
of E° sentences over arbitrary structures. As a sharper characterization, we state our 
result (but provide the proof later in Section 7) that for core sizes bounded by a number 
B, there is a syntactic characterization in terms of sentences that use atmost B exis- 
tential quantifiers. In Section 3, we discuss how the notion of relativization can be used 
to prove the sharper characterization in special cases and also discuss its limitations. 
We present our studies for special classes of FO and over special classes of structures 
in Sections 4 and 5. As a fallout of our studies, we obtain combinatorial proofs of the 
classical Los-Tarski theorem for some of the aforesaid special cases, and also obtain 
semantic characterizations of natural subclasses of the A° fragment of FO. In Section 
7, we provide the proof of the sharper characterization using tools from classical model 
theory and some notions that we define. We conclude with questions for future work 
in Section 8. 

We assume that the reader is familiar with standard notation and terminology used in 
the syntax and semantics of FO (see [8]). A vocabulary t is a set of predicate, function 
and constant symbols. In this paper, we will restrict ourselves to finite vocabularies 
only. A relational vocabulary has only predicate and constant symbols, and a purely 
relational vocabulary has only predicate symbols. We denote by FO(t), the set of all 
FO formulae over vocabulary r. A sequence (xi, . . . , Xk) of variables is denoted by 
x. We will abbreviate a block of quantifiers of the form Qx\ . . . Qxk by Qx, where 
Q G {V, 3}. By S° (resp. 11°), we mean FO sentences in Prenex Normal Form 
(PNF) over an arbitrary vocabulary, whose quantifier prefix begins with a 3 (resp. V) 
and consists of k — 1 alternations of quantifiers. We use the standard notions of r- 
structures, substructures and extensions, as in [8]. Given r— structures M and N, we 
denote by M C N that M is a substructure of N (or N is an extension of M), Given 
M and a subset S (resp. a tuple a of elements) of its universe, we denote by M(S) 
(resp. M(a)) the smallest substructure (under set inclusion ordering of the universe) of 
M containing S (resp. underlying set of a) and call it the substructure of M induced 
by S (resp. underlying set of a). Finally, by size of M, we mean the cardinality of 
its universe and denote it by \M\. As a final note of convention, whenever we talk 
of FO definability in the paper, we mean definability via FO sentences (as opposed to 
theories), unless stated otherwise. 

2 Preservation under substructures modulo cores 

We denote by PS the collection of all classes of structures, in any vocabulary, which 
are closed under taking substructures. This includes classes which are not definable in 
any logic. We let PS denote the collection of FO definable classes in PS. We identify 
classes in PS with their defining FO sentences and will henceforth treat PS as a set of 
sentences. We now consider a natural generalization of the PS property. Our discussion 
will concern arbitrary (finite) vocabularies and arbitrary structures over them. 
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2.1 The case of finite cores 

Definition 1 (Preservation under substructures modulo finite cores) 
A class of structures S is said to be preserved under substructures modulo a finite core 
( denoted S G PSC / ), if for every structure M G S, there exists a finite subset C of 
elements of M such that if Mi C M and M\ contains C, then M\ G S. The set C is 
called a core of M w.r.t. S. If S is clear from context, we will call C as a core of M. 

Note that any finite subset of the universe of M containing a core is also a core of M. 
Also, there can be multiple cores of M having the same size. A minimal core of M is 
a core, no subset of which is a core of M. 

We will use PSC / to denote the collection of all classes preserved under substructures 
modulo a finite core. Similarly, we will use PSCf to denote the collection of FO defin- 
able classes in PSC/. We identify classes in PSCf with their defining FO sentences, 
and will henceforth treat PSCf as a set of sentences. 

Example 1: Let S be the class of all graphs containing cycles. For any graph in S, the 
vertices of any cycle is a core of the graph. Thus S G PSC/. 

Note that P§ C PSC / since for any class in PS and for any structure in the class, any 
element is a core. However it is easy to check that S in above example is not in PS; so 
PSC / strictly generalizes PS. Further, the FO inexpressibility of S shows that PSC / 
contains classes not definable in FO. 

Example 2: Consider <fi = 3xVyE(x, y). In any graph satisfying <j), any witness for x 
is a core of the graph. Thus G PSCf. In fact, one can put a uniform bound of 1 on 
the minimal core size for all models of <j>. 

Again it is easy to see that PS C PSCf. Specifically, the sentence <f> in Example 2 is 
not in PS. This is because a directed graph with exactly two nodes a and b, and having 
all directed edges except the self loop on a models but the subgraph induced by a 
does not model <fi. Hence PS C PSCf. Extending the example above, one can show 
that for any sentence <p in E°, in any model of ip, any witness for the 3 quantifiers in 
ip forms a core of the model. Hence £!] £ PSCf. In fact, for any sentence in E°, 
the number of 3 quantifiers serves as a uniform bound on the minimal core size for 
all models. Surprisingly, even for an arbitrary (j> G PSCf, it is possible to bound the 
minimal core size for all models! 

Towards the result, we use the notions of chain and union of chain from the literature. 
The reader is referred to [5] for the definitions. We denote a chain as Mi C M-i C . . . 
and its union as [_J i>0 M*. We say that a sentence (j> is preserved under unions of chains 
if for every chain of models of <\>, the union of the chain is also a model of <fi. We now 
recall the following characterization theorem from the '60s [5]. 

Theorem 1 ( Chang-Los-Suszko) A sentence <j) is preserved under unions of chains iff 
it is equivalent to a n° sentence. 

Now we have the following theorem. 

Theorem 2 A sentence <f> G PSCf iff (j) is equivalent to a S§ sentence. 
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Proof: We infer from Theorem 1 the following equivalences. 

is equivalent to a Eq sentence iff 
-i(f> is equivalent to a ITq sentence iff 

VMl M a , . . . ((Mi C M 2 C . . .) A (M = U 4 >i M) A V*(M (= ^0)) -+ M |= -0 
iff 

VMi, M 2 , . . . ((Mi C M 2 C . . .) A (M - U,>i ^) A (M |= 0)) ->. 3i(M< (= 0) 

Assume G PSCf. Suppose Mi C M2 C ... is a chain, M = U;>oM and 
M \= <f). Then, there exists a finite core C of M. For any a G C, there exists an 
ordinal i a s.t. a G Mi a (else a would not be in the union M). Since C is finite, let 

1 = max(i a | a G C). Since i a < i, we have Mj a C Mjj hence a G Mj for all a G C. 
Thus Mj contains C. Since C is a core of M and Mj C M, Mj (= by definition of 
PSCf. By the equivalences shown above, is equivalent to a £ 2 sentence. We have 
seen earlier that Yfi, C PSCf. ■ 

Corollary 1 6 PSCf, there exists B G N 5mc/i f/iaf every model of <fi has a core 
of size atmost B. 

Proof: Take B to be the number of 3 quantifiers in the equivalent S 2 sentence. ■ 

Given Corollary 1, it is natural to ask if B is computable. In this context, the following 
recent unpublished result by Rossman [11] is relevant. Let \<f>\ denote the size of <f>. 

Theorem 3 (Rossman) There is no recursive function / : N — > N such that iftpE PS, 
then there is an equivalent TL® sentence of size atmost f(\<f>\). The result holds even for 
relational vocabularies and further even if PS is replaced with PS D S 2 . 

Corollary 2 There is no recursive function f : N — > N such that if <f> G PS, then there 
is an equivalent sentence with atmost f(\(j>\) universal variables. The result holds 
even for relational vocabularies and further even if PS is replaced with PS PI S 2 . 

Proof: Let ip — Y l zip(z) be a IlJ sentence equivalent to cf> where n — f(\4>\). Let 
k be the number of atomic formulae in i/j. Since </> and ip have the same vocabulary, 
k G 0(|c/>| -nW). The size of the Disjunctive Normal Form of ip is therefore bounded 
above by 0(k ■ n ■ 2 k ). Hence \<p\ is a recursive function of \<j)\ if / is recursive. ■ 

Theorem 3 strengthens the non-elementary lower bound given in [6] . Corollary 2 gives 
us the following. 

Lemma 1 There is no recursive function f : N — > N s.t. if <p G PSCf, then every 
model of <p has a core of size atmost f{\4>\). 

Proof: Consider such a function /. For any sentence in a relational vocabulary 
t s.t. <p G PS, ^<t> is equivalent to a sentence by Los-Tarski theorem. Hence 
-|0 G PSCf. By assumption about /, the size of minimal models of -i<f> is bounded 
above by n = /(|0|) + k, where k is the number of constants in r. Therefore, ^0 
is equivalent to an 3 n sentence and hence is equivalent to a V" sentence. Corollary 
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2 now forbids n, and hence /, from being recursive. It is easy to see that the result 
extends to vocabularies with functions too (by using functions in a trivial way). ■ 

Corollary 1 motivates us to consider sentences with bounded cores since all sentences 
in PSCf have bounded cores. 

2.2 The case of bounded cores 

We first give a more general definition. 

Definition 2 (Preservation under substructures modulo a bounded core) A class of 
structures S is said to be preserved under substructures modulo a bounded core (de- 
noted S G PSC), if S G PSC/ and there exists a finite cardinal B dependent only on 
S such that every structure in S has a core of size atmost B. 

The collection of all such classes is denoted by PSC. Let PSC(£?) be the sub-collection 
of PSC in which each class has minimal core sizes bounded by B. Then PSC = 
U B>0 PSC(B). An easy observation is that PSC(i) C PSC(j) for i < j. As before, 
PSC and each PSC(£>) contain non-FO definable classes. As an example, the class of 
forests is in PSC(O). Let PSC (resp. PSC(B)) be the FO definable classes in PSC 
(resp. PSC(B)). Observe that PSC(0) is exactly PS and PSC = \Jb>q PSC(B). 
Therefore, PSC generalizes PS. Further, the hierarchy in PSC is strict. Consider 
<j> G PSC(k) given by cf> = 3x t . . . 3x k Ai<i<i<fe ^ = x o)- Then t PSC(l) 
for I < k. From Corollary 1, we have 

Lemma 2 PSC = PSC f . 

As noted earlier, a sentence (f> with B existential quantifiers is in PSCf with mini- 
mal core size bounded by B. Hence </> G PSC(B). In the converse direction, Theorem 

2 and Lemma 2 together imply that for a sentence (j> G PSC(B), there is an equivalent 
S° sentence. We can then ask the following sharper question: For <fi G PSC(B), is 
there an equivalent E° sentence having B existential quantifiers? 

Theorem 4 A sentence <fi G PSC(B) iff it is equivalent to a sentence with B 
existential quantifiers. 

The proof of this theorem uses tools from classical model theory and some notions that 
we define. We will present it in Section 7. Before that we shall consider Theorem 4 
for special fragments of FO and for special classes of structures. Towards this, we first 
look at the notion of relativization from the literature. 

3 Revisiting Relativization 

For purposes of our discussion in this and remaining sections of the paper, we will 
assume relational vocabularies (predicates and constants). 

A notion that has proved immensely helpful in proving most of our positive special 
cases of Theorem 4 is that of relativization. Informally speaking, given a sentence <f>, 
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we would like to define a formula (with free variables x) which asserts that <f> is true 
in the submodel induced by x. The following lemma shows the existence of such a 
formula. 

Lemma 3 If t is a relational vocabulary, for every FO(t) sentence <p and variables 
x = (xi, . . . , Xk), there exists a quantifier- free formula <p\ x with free variables x such 
that the following holds: Let M be a model and a = (cti, . . . , dfc) be a sequence of 
elements of M. Then 

(M,a u ...,a k ) (=0| a #M({oi J ...,o fc }) \=4> 

Proof: Let X = {xi, . . . ,Xk} and C be the set of constants in r. First replace 
every V quantifier in <j> by -3, Then replace every subformula of <fi of the form 
3x\(x, yi, . . . ,y k ) by V 2S xuc *( z > yi, ■ ■ ■ , yu)- ■ 

We refer to <fi\ x as '</> relativized to x'. We shall sometimes denote <j>\ x as <fi\{ Xl ,... : x k } 
(though a; is a sequence and {xi, . . . , Xh} is a set). 

We refer to 0| 2 as '</> relativized to x' . For clarity of exposition, we will abuse notation 
and use 4 , \{ Xl ,....x k } to denote <fr\ x (although x is a sequence and {xi, . . . , Xk} is a set), 
whenever convenient. 
We begin with the following observation. 

Lemma 4 Over any given class C of structures in PS, if <j) O Vzi . . . "iz n ip where if is 
quantifier-free, then <j) O where ip — Vzi . . . Vz n 0|{ Zl 

Proof: It is easy to see that — > ip. Let M £ C be s.t. M |= ip. Let a be an 
n-tuple from M. Then, by Lemma 3, M (a) \= (j>. Since C e PS, A/ (a) 6 C 
so that M(a) |= Vzi . . .\/z n ip. Then Af(a) |= tp(a) and hence A/ |= </?(a). Then 
A/ |= Vzi . . . Vz„(/3 and hence Af ^ 0. ■ 

Using Los-Tarski theorem and the above lemma, it follows that a sentence cj> in PS 
has an equivalent universal sentence whose matrix is <fi itself relativized to the univer- 
sal variables. However we give a proof of this latter fact directly using relativization, 
and hence an alternate proof of the Los-Tarski theorem. We emphasize that our proof 
works only for relational vocabularies (Los-Tarski is known to hold for arbitrary vo- 
cabularies). This would show that relativization helps us prove Theorem 4 for the case 
of B = 0. 

3.1 A proof of Los-Tarski theorem using relativization 

We first introduce some notation. Given a t— structure A/, we denote by tm, the 
vocabulary obtained by expanding r with as many constant symbols as the elements of 
M - one constant per element. We denote by A4 the tm structure whose t— reduct is 
M and in which each constant in tm is interpreted as the element of M corresponding 
to the constant. It is clear that M uniquely determines A4. Finally, T>(M) denotes the 
diagram of Af - the collection of quantifier free tjv/— sentences true in M.. 
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Theorem 5 (Los-Tarski) A FO sentence </> is in PS iff there exists an n G N such that 
<j) is equivalent to Vzi . . . \/z n (f>\{ Zl ,... jZ?l }- 

Proo/ - : 

Consider a set of sentences V = \ k G N, = Vzi . . . Vzfc0|{ 2l! .... 2fc }}. Observe 
that — !> so that a finite collection of £fcS will be equivalent to where fc* is 
the highest index fc appearing in the collection. We will show that <j> O T. Once we 
show this, by compactness theorem, (jloTi for some finite subset Ti of T and by the 
preceding observation, (f> is equivalent to € Y\ for some n. 

If M \= then since G PS, every substructure of it models <f> - in particular, the 
substructure induced by any fc-elements of AI. Then AI \= for every k and hence 

M \= r. 

Conversely, suppose M |= T. Then every finite substructure of M models (j>. Let Ai 
be the Tm structure corresponding to M. Consider any finite subset S of the diagram 
V(M) of M. Let C be the finite set of constants referred to in S. Clearly Ai\ T uc^ 
namely the (r U C)-reduct of Ai models S since Ai \= 1>(M). Then consider the 
substructure Aii of Ai \ T uC induced by the intepretations of the constants of C - this 
satisfies S. Now since C is finite, so is Adi. Then the r— reduct of M.\ - a finite 
substructure of M models </>. 

Thus S U {</>} is satisfiable by Mi. Since 5 was arbitrary, every finite subset of 
V(M) U {<fi} is satisfiable so that by compactness, T>(M) U {</>} is satisfiable by some 
structure say Af. Then the r— reduct N of TV is s.t. (i) AI is embeddable in N and (ii) 
N \= 4>. Since G -P5, the embedding of AI in N models <fr and hence AI \= (f>. ■ 

The above proof shows that for <p G PS, there is an equivalent universal sentence 
whose matrix is 4> itself, relativised to the universal variables. In fact, by Lemma 4, 
there is an optimal (in terms of the number of universal variables) such sentence. 
An observation from the proof of Theorem 5 is that, the Los-Tarski theorem is true 
over any class of structures satisfying compactness - hence in particular the class of 
structures definable by a FO theory (indeed this result is known). But there are classes 
of structures which are not definable by FO theories but still satisfy compactness: Con- 
sider any FO theory having infinite models and consider the class of models of this 
theory whose cardinality is not equal to a given infinite cardinal. This class satisfies 
compactness but cannot be definable by any FO theory due to Lowenheim-Skolem 
theorem. Yet Los-Tarski theorem would hold over this class. 

Having seen the usefulness of relativization in proving Theorem 4 when B equals 0, 
it is natural to ask if this technique works for higher values of B too. We answer this 
negatively. 

3.2 Limitations of relativization 

We show by a concrete example that relativization cannot be used to prove Theorem 
4 in general. This motivates us to derive necessary and sufficient conditions for rela- 
tivization to work. 

Example 3: Consider <fi = 3x\lyE{x,y) over r = {E}. Note that cf> is in PSC(1). 
Suppose <fi is equivalent to if) = 3x\/ n y(f>\ X y for some n. Consider the structure 
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M = (Z, <) namely the integers with usual < linear order. Any finite substructure 
of M satisfies <fi since it has a minimum element (under the linear order). Then taking 
x to be any integer, we see that M \= ip. However M ^ <fi since M has no minimum 
element - a contradiction. The same argument can be used to show that <fi cannot be 
equivalent to any sentence of the form 3 n x V m y <p\sy- 

We now give necessary and sufficient conditions for relativization to work. Towards 
this, we introduce the following notion. Consider <fi G FO(t) s.t. <fi G PSC(B). 
Consider a vocabulary tb obtained by expanding r with B fresh constants. Consider 
the class of r^-structures with the following properties: 

1. For each (M, oi, . . . , «b) G S^ 1 where M is a t— structure and oi, . . . , a B G 
M, M |= <p and {oi, . . . , as} forms a core of M w.r.t. <fr. 

2. For each model M of <p>, for each core C = {a±, . . . ,ai} of M w.r.t. </> s.t. 
I < B and for each function p : {1, . . . , B} — > C with range C, it must be that 
(M,p(l),...,p(B))eSf. 

We now have the following. 

Theorem 6 Given <fi G PSC(B), the following are equivalent. 

1. S^' is finitely axiomatizable. 

2. (j> is equivalent to 3 B x V™y (j>\xy for some n G N. 

3. (f> is equivalent to a 3 B V* sentence ip such that in any model M ofip and cf>, the 
following hold: 

(a) The underlying set of any witness for ip is a core of M w.r.t. (f>. 

(b) Conversely, if C is a core of M w.r.t. <j), x±, . . . ,xb are the 3 variables 
of ip and f : {xi, . . . ,xb} ~ > C is any function with range C, then 
(f(xi), . . . , f{xs)) is witness for ip in M. 

Proof: 

(1) -> (2): Let Sf be finitely axiomatizable. Check that Sf G PS so that by Los- 
Tarski theorem, it is axiomatizable by a FO(rB)-sentence ip having say n V quan- 
tifiers. Further, by Lemma 4, ip is equivalent to 7 = V™zip\z. Now consider cp = 
3 B x V n y (p\xy- Firstly, from Lemma 5, <p ~ > <P- Conversely, suppose M |= (p. Let 
01, . . . , 0,3 be witnesses and consider the T^-structure Mb = (M, a-y, . . . , as). Now 
Mb h 4>\xy- We will show that M B |= 7. Consider &i, . . . , b n G M and let 
Mi = M B ({bi, • ■ • , b n }). Then M t \= V"y (p\ m . Check that the r-reduct of M t 

(1) models <p and (ii) contains {ai, . . . , as} as a core. Then Mi G S^ 1 and hence 
Mi |= ip. Since b\,...,b n were arbitrary, M b (= 7- Since 7 -H- tp and ip axiomatizes 
S a ^, the t— reduct of Mb, namely M , models <p. 

(2) — > (3): Take ?/' to be 3 B x \/ n y 4>\xy- Consider a model M of <p> and ip. The set C of 
elements of any witness for if,' forms a core of M w.r.t. ?/'■ Then since <p ip,C is also 
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a core of M w.r.t. 0. Conversely, consider a core C of M w.r.t. 0. Then any substruc- 
ture of M containing C satisfies 0. Then check that elements of C form a witness for tp. 

(3) — > (1): Let -H> -0 where -0 = 3 B x V n yf3(x, y) where (3 is quantifier free and tp 
satisfies the conditions mentioned in (3). Consider ip = V n y (3[xi i— > ci, . . . ,xb i-> 
cb] where ci, . . . ,Cb are _B fresh constants and Xi i-> cj means replacement of 
by If Ms = (M, di, . . . , Ob) |= tp, then M (= -0 and hence M (= 0. Since 
ai, . . . , as are witnesses for tp in A/, they form a core of M w.r.t. by assumption, 
so that M B £ Sf. Conversely, if Mb = (M, oi, . . . , a B ) £ S^ 11 , then M |= and 
ai, . . . , a B form a core in M. Then by assumption, M (= ^ and ai, . . . , <zb are wit- 
nesses for ?/'■ Then Mb |= tp. To sum up, y> axiomatizes S^. ■ 

Consider and M in the Example 3 above. Take any finite substructure Mi of M - it 
models 0. There is exactly one witness for in Ml, namely the least element under <. 
However every element in Mi serves as a core. The above theorem shows that no 3V* 
sentence will be able to capture exactly all the cores through its 3 variable. 
In the following sections, we shall study Theorem 4 for several special classes of FO 
and over special structures. Interestingly, in most of the cases in which Theorem 4 turns 
out true, relativization works! However we also show a case in which relativization 
does not work, yet Theorem 4 is true. 

4 Positive Special Cases for Theorem 4 
4.1 Theorem 4 holds for special fragments of FO 

Unless otherwise stated, we consider relational vocabularies throughout the section. 
The following lemma will be repeatedly used in the subsequent results. 

Lemma 5 Let <G PSC(B). For every n £ N, implies 3 B x V n y 4>\xy- 

Proof: Suppose M |= 0. Since £ PSC(B), there is a core C of M of size at most 
B. Interpret x to include all the elements of C (in any which way). Since C is a core, 
for any n-tuple d of elements of M, having underlying set D, the substructure of M 
induced by C U D models 0. Then (M, a, d) \= 4>\xy for all d from M. ■ 

Lemma 6 Lef r be a monadic vocabulary containing k unary predicates. Let S 
FO(t) be a sentence of rank r s.t. £ PSC(B). Then is equivalent to tp where 
tp = 3 B x y n y <fi\xy where n = r x 2 fc . For B = 0, n is optimal i.e. there is an FO 
sentence in PSC(0) for which any equivalent n° sentence has atleast n quantifiers. 

Proof : That implies tp follows from Lemma 5. For the converse, suppose M \= tp 
where n — r x 2 fe . By an Ehrenfeucht-Fraisse game argument, we can show that M 
contains a substructure Ms such that (i) M = r Ms, with |Ms| < n and (ii) for any 
extension M' of Ms in M, M' = r Ms- The substructure Ms is obtained by taking up 
to r elements of each colour c £ 2 r present in M. An element a in structure M is said 
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to have colour c if for every predicate P E S, M \= P(a) iff P G c. Since M \= ip, 
there exists witnesses a for ^> in M. Choose b to be an n-tuple which includes the 
elements of Mg. This is possible because \Ms\ < n. Then we have, (M, a, b) |= cp\ S y 
so that M(ab) (= 0. But M s C M(afc) C M so that M(ab) = r M. Then M \= (p. 

To see the optimality of n for B = 0, consider the sentence which states that there 
exists at least one colour c G 2 r such that there exist at most r — 1 elements with 
colour c. The sentence (p can be written as a formula with rank r, as the disjunction 
over all colours, of sentences of the form, 3xi3a;2 ■ • ■ 3x r _iVa; r (/\[~ 1 1 x r ^ x{) — > 
~^C(x r ). From the preceding paragraph, (p o V"y (p\y where n = r x 2 fe . Suppose 
is equivalent to a V s sentence for some s < n. Then by Lemma 4, <f> <{> where 
(f = y s y 4>\y. Then consider the structure M, which has r elements of each colour. 
Clearly, M ^= </>. However check that every s-sized substructure of M models <fi. Then 
M \= tp and hence M \= <f> - a contradiction. ■ 

Lemma 7 Let S G PSC(-B) be a finite collection of t— structures so that S is definable 
by a £2 sentence (j) G PSC(B). Then S is definable by the sentence ip where ip = 
3 B x \/ n y 4>\xyfor some n G N. 

Proof : Check that all structures in S must be of finite size so that <f> exists. Let the size 
of the largest structure in S be atmost n. Consider ip. Lemma 5 shows that <fi — > tjj. 
Conversely, suppose M \= ip. Then there exists a witness a s.t. any extension of M(a) 
within M with atmost n additional elements models <p. Since M is of size atmost n, 
taking the extension M of M(a), we have M \= (p. Since <p defines S so does ip. ■ 

Lemma 8 Consider <p G Ilj given by (p = V™5 3 m y y) where ft is quantifier free. 
If (p G PSC(B), then (p is equivalent to ip where ip = 3 B u \/ n v <p\uv- 

Proof: From Lemma 5, <p — > ip. For the converse, let M \= ip and let a be a witness. 
Consider an n— tuple b from M. Then M\ = M(ab) is s.t. Mi |= (p. Then for x = 6, 
there exists y = d s.t. d is an m— tuple from Mi and A/i |= /3(fe, cQ. Then M |= f3(b, d) 
since Mi C M. Hence M |= <p. ■ 

Lemma 9 Suppose <p G PSC(B) and -*p G PSC(B'). Then <p is equivalent to ip 
where ip = 3 B x V s y <^|sy- 

Proof: From Lemma 5, </> implies if,'. For the converse, suppose M \= ip. Then there 
is a witness a for ip s.t. for any S'-tuple 6, the substructure induced by ab i.e. M(ab) 
models <p. Suppose M ^= 0. Then M ^= -1^ so that there is a core C of M w.r.t. -i</>, 
of size at most Let d be a i?'-tuple which includes all the elements of C. Then 
M (ad) f= 0. But M(ad) C M contains C so that M(ad) (= ->0 - a contradiction. ■ 

Observe that for the special case of _B = 0, we get combinatorial proofs of Los-Tarski 
theorem for the fragments mentioned above. Moreover all of these proofs and hence 
the results hold in the finite. We mention that the result of Lemma 8 holding in the 
finite was proved by Compton too (see [7]). We were unware of this until recently and 
have independently arrived at the same result. The reader is referred to Section 6 for 
our studies on more positive cases of Los-Tarski in the finite. 
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Interestingly, Lemma 9 has implications for the A° fragment of FO. Define A° (k, I) C 
A° to be the class of sentences which have a 3 fe V* and a V'3* equivalent. Note that 
A° = \J l fe>0 A^fc, Z). Lemma 9 gives us the following right away. 

Theorem 7 The following are equivalent: 
!.(/)£ PSC{k) and -.0 G PSC^). 

2. is equivalent to a 3 k \/ 1 and a \/ l 3 k sentence. 

3. 0G A°(fe,Z). 

As a corollary, we see that A!](fc, I) is a finite class upto equivalence. We are not 
aware of any other semantic characterization of these natural fragments of A§. This 
highlights the importance of the notion of cores and the sizes thereof. 

4.2 Theorem 4 over special classes of structures 

We first look at Theorem 4 over finite words which are finite structures in the vocab- 
ulary containing one binary predicate < (always interpreted as a linear order) and a 
finite number of unary predicates (which form a partition of the universe). And we 
obtain something stronger than Theorem 4. Before that, we mention that the idea of 
relativization can be naturally extended to MSO. Given <fi in MSO and a set of variables 
Z = {zi, . . . , z n }, (f)\z is obtained by first converting all VX to ->3X and then replac- 
ing every subformula 3X X (X, . . .) with Vycz((A zG y x ( z ) A A z£ z\y ^ x ( z )) a 
X(X, . . .)). The resulting FO formula is then relativized to Z and simplified to elim- 
inate the (original) SO variables. As before, abusing notation, we use <fi\z and </>| 2 
interchangeably. 

Note: We at times will refer to the 'structure' connotation of a word and at other times 
refer to the 'string' connotation of it. This would however be clear from the context 
(typically language-theoretic notions used for a word would mean we are talking about 
it as a string whereas model-theoretic notions used for it would mean we are referring 
to it as a structure). 

Theorem 8 Over words, a MSO sentence <fi is in WSC(B) iff it is equivalent to ip where 
ip = 3 B x\/ k y(f)\xy for some k G N. 

Proof sketch: We use the fact that over words, by the Buchi-Elgot-Trakhtenbrot theo- 
rem [4], MSO sentences define regular languages. The 'If direction is easy. For the 
'Only if direction, let the regular language L defined by be recognized by an n state 
automaton, say M.. If there is no word of length > N = (B + 1) x n in L, then L is 
a finite language of finite words and hence from Lemma 7, we are done. Else suppose 
there is a word of length > N in L. Then consider tp above for fc = iV. It is easy to ob- 
serve that (f> implies tp. In the other direction, suppose w \= tp for some word w. Then 
there exists a set A of elements i\, . . . ,i m s.t. (i) m < B and ix < i% ■ ■ ■ < i m and (ii) 
every substructure of w of size atmost N + m containing A models </>. From Lemma 
10 below, there exists a substructure Wi of w containing A such that (i) \wi\ < N and 
(ii) w\ G L iff w G L. Then wi models (f> and hence w (= (p. Thus ip implies 4> and 
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hence is equivalent to <fi. ■ 

Before going into the proof of the lemma, we briefly explain the intuition. Let qj be 
the state reached by automaton M upon reading the subword w[l . . . ij}. The subword 
w[(ij + 1), . . . ij+i] takes M. from qj to qj+i through a sequence S of states. Since 
M. has only n states, if w[(ij + 1), . . . ij+i] is long, then S will contain at least one 
loop. Then getting rid of the subwords that give rise to loops, we will be able to obtain 
a subword of w[(ij + 1), . . . ij+i] that takes Ai from qj to qj + i without causing Ai to 
loop in between. It follows that this subword must be of length at most n. Collecting 
such subwords of w[(ij + 1), . . . ij+i] for each j and concatenating them, we get a 
subword of w of length at most N containing set A that takes Ai from the initial state 
to the same state as w. We now formalize this intuition. 

Lemma 10 Let L be a regular language having an n state automaton accepting it. 
Given a natural number B, consider a wordw G E* of length > N = (B + 1) x n. Let 
A = {ii, . . . , i m } where i\ < ii . . . < i m be a given set of elements from the universe 
of w. Then there is a substructure W\ of w containing A such that (i) \w±\ < N and 
(ii) w\ <E L iff w G L. 

Proof: 

Let M = (Q, E, S, qo, F) be a DFA accepting L where Q = {qo, . . . , q n -i} is the set 
of states, E is the alphabet, 5 is the transition function, qo is the initial state and F is 
the set of final states. We use the following notation: If z is a sequence of objects, then 
we use z(k) to denote the k th element of z and z [k . . . I] to denote the subsequence of 
z formed by the k th , (k + l) th , . . .I th elements of z for k, I s.t. 1 < k < I < (length 
ofz). 

Let q(i + 1), 1 < i < \w\ be the state of Q after reading the word w [1 . . A], We take 
q(l) to be go- Then let q = (5(i))i<i<(| u ,|+i) be the sequence of these states. We are 
given A = . . . , i m } which is a subset of m elements of the universe of w. Let 
io = 1 and i m +i = \w\ + 1- For j 6 {0, . . . , m}, consider q [ij . . . ij+i]- Set p = ij 
to s = ij + i — 1. We collect a set T of indices between p and s using the procedure 
below: 

Initialize i to p. 

1. If i > s, then stop. 

2. If i = s, then put i into T and increment i by 1. 

3. If i < s, then let k s.t. p < k < s be the highest index such that q(i) = q(k). 
Then put k into T and update the value of i to be k + 1. 

At the end of this procedure, let the indices in T be k\ , . . . , ki where k\ < &2 < • • • < 
ki if T is non-empty. Note that T is empty iff ij = ij + i only if j = 0. Also note that 
at termination, the value of i must be s + 1. Finally note that q(ij),q(ki + 1), q(k2 + 
1), . . . , q(ki) must all be distinct so that I < n. 
Then consider the subword Wj of w given by 
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( 



e if T is empty 

w(ki) ■ w(k2) ■ ■ ■ w(ki) if T is non-empty 



Observe that \wj\ < n. Let r%, . . . , ri be the states the automaton M goes through 
when Wj is applied to state q(ij). 
We consider the following cases: 

1 . T is non-empty. 

Now from the way ki was chosen by the above procedure, q(ij) — q(k\). Then 
if M is in state q(ij), on w{k\), it moves to state n given by r\ = q(ki + 1). 
Similarly, the index k^ is s.t. q(k-i) = q(ki + l) so that if M is in state n, then on 
w{k%), it moves to state r-i given by r2 = q(k2 + 1). Continuing this way we find 
that on w(ki), if M is in state rj_i, it moves to state n given by r; = q(ki + 1). 
Now as observed above, at termination, the value of i must be s + 1 = ij+i . This 
can happen in only two ways: (a) In the previous iteration of the procedure, step 
(2) was executed in which case s was put in T - then ki = s. (b) In the previous 
iteration of the procedure, step (3) was executed in which case s again was put 
into T so that fc; = s. Then in either case ki = s = ij+i — 1 so that r; = 

Thus we see that both Wj and w [ij . . . (ij+i — 1)], when applied to M in state 
q(ij), take M to the same state, namely q(ij +1 ). 

2. T is empty. 

Then Wj = e and ij = in which case w [ij . . . {ij+i — 1)] = e so that 
both these words applied to M in state q(ij), take M to the same state, namely 

Then consider the word wi = Wo ■ w\ ■ ■ ■ w m . From the above observations, it follows 
that w\ applied to the initial state of M takes M to the same state as w. Then w\ G L 
iff w G L. Further since for each j, \wj\ < n, we have that \w\\ < (m + 1) x n < 



Returning to Theorem 8, observe that for the special case of B = 0, we obtain Los- 
Tarski theorem for words and also give a bound for the number of Vs in the equivalent 
II" sentence in terms of the number of states of the automaton for <fi. We have not 
encountered this result in our literature survey. 

Before proceeding ahead, as a slight diversion, we give a simpler proof of Los-Tarski 
theorem over words. In fact, over words, we have the following stronger result. 

Lemma 11 Consider any set S of words which is closed under taking substructures. 
Then S can be defined by a Tx{ sentence. 

Proof : Consider S = E* \ S - the complement of S. Since S is closed under taking 
substructures, S is closed under taking extensions. Then consider the set T of minimal 
words of S, i.e. words of S for which no subword is contained in S. We show that 
T must be finite. Suppose T were infinite. If we arrange the words of T to form 
a sequence - which is infinite - then by Higman's lemma, there is some word in the 



(B + 1) X n = N. 



14 



sequence which is a subword of another in the sequence. That means some word of 
T is a subword of another word in T. But that contradicts the minimality of the latter 
word. 

Then T is finite. Taking the disjunction of the existential closures of the diagrams of 
the words of T, we get a E° sentence defining S. Then taking the negation of this 
sentence, we get the desired ITj sentence defining S. M 

Thus contrary to the general setting where it is not necessary for a set of structures 
preserved under substructures to be even FO-expressible, leave alone being definable 
by a sentence, over words, II" sentences show much greater power. 

We return to Theorem 4 now. So far, relativization has worked in all the cases we have 
seen. We now give an example of a class of structures over which relativization fails, 
yet Theorem 4 is true. 

Consider a subclass C of bounded degree graphs in which each graph is a collection 
(finite or infinite) of oriented paths (finite or infinite). For clarity, by oriented path we 
mean a graph isomorphic to a connected induced subgraph of the graph (V, E) where 
V = Z and E — {(i, i + 1) | i G Z}. Observe that C can be axiomatized by a theory T 
which asserts that every node has in-degree atmost 1 and out-degree atmost 1 and that 
there is no directed cycle of length k for each k > 0. We first show the following. 

Lemma 12 For each B > 1, there is a sentence <fi G PSC (B) which is not equivalent, 
over C, to any ip of the form 3 B x V"y 4>\xy ■ 

Proof: Consider <j> which asserts that there are atleast B elements of total degree atmost 
1 where total degree is the sum of in-degree and out-degree. Clearly <f> G PSC(B) 
since it is expressible as a 3 B V* sentence. Suppose <j> is equivalent to ip of the form 
above for some n G N. Consider M G C which is a both-ways infinite path so that 
every node in M has total degree 2 - then M ^ 4>- Consider B distinct points on 
this path at a distance of atleast 2n from each other and form a B— tuple say a with 
them. Let b be any n— tuple from M. Now observe that M(ab) is a finite structure 
which has atleast B distinct paths (0-sized paths included). Then M(ab) \= <\> so that 
(M, a, b) \= 4>\xy- Since b was arbitrary, M \= ip so that M \= <fi. Contradiction. ■ 

However Theorem 4 holds over C ! 

Theorem 9 Over the class C of graphs defined above, <fi G PSC(B) iff (f> is equivalent 
to a3 B V* sentence. 

Proof: If r = {E} is the vocabulary of <f>, let tb be a vocabulary obtained by addding 
B fresh constants c\ , . . . , Cb to r. Given a class S of t— structures, define Sb to be the 
class of all tb~ structures s.t. the r— reduct of each structure in Sb is in S. Then the 
proof can be divided into two main steps. Below = denotes elementary equivalence. 
Step 1 : Given (f>, define class C'CC such that for every structure A G Cb, there exists 
structure D G C' B such that A = D (Property I). Since compactness theorem holds 
over Cb (as Cb is defined by the same theory T as C), it also holds over C' B - 
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Step 2 : Show that <j> is equivalent to an 3 B \/* sentence over C, hence showing the 
same over C as well. 

Note: The conditions in Step 1 imply that for every A G C, there exists D G C such 
that A = D. Then since compactness theorem holds over C, it also holds over C. 
Suppose the rank of <fi is m. We define C to be the set of graphs G E C such that either 
(a) there exists a bound ng (dependent on G) such that all paths in G have length less 
than uq (this does not mean that G is finite - there could be infinite paths of the same 
length in G) or (b) there are atleast (B + m + 2) paths in G which are infinite in both 
directions. It can be shown that C satisfies Property I (See A below). We proceed 
assuming this to be true. 

Now, to show Step 2, we use the following approach. 

Let P 6 C be s.t. P \= (f>. Choose a core Z in P (recall that (f> G PSC(B)). Let 
Alp G C' B be a t^— structure whose t— reduct is P and in which each element of Z 
is assigned to some constant. Let r Mp be the set of all V* sentences true in Mp. We 
can show that if M' G C' B is such that M' \= T Mp , then M' \= <j> (See B below. 
We proceed assuming this to be true). That is, if every finite substructure of M' is 
embeddable in Mp, then M' \= <fi. Then over C' B , T Mp — > cf>. Now, since C' B satisfies 
compactness theorem, there exists a finite subset T Q F of r A/p such that T^ Ip — > if> 
over C' B . Note that, since Fq /p is a conjunction of V* sentences, we can assume that 
Tq If is a single V* sentence. 

Let <j)p be the r— sentence of the form 3 B V* obtained by replacing the B constants in 
T^ Ip by B fresh variables and existentially quantifying these variables. Then check 
that cf>p — > 4>. It is easy to see that <fi —> Vpec' p\=4> § p P H 4>> then interpret the 
3 quantifiers in <fip as the chosen core Z mentioned above). By compactness theorem 
over C, there exists a finite set of structures, say {Pi, • • • , P m } such that Pi 6 C, 
Pi \= (f> and (f> — > Vi=i* ^Pi ■ Then, we have <fr <-> Vi=cT § p i over ^ • Since each <f>p i 
is of the form 3 S V*, Vi=cT 0-P< ls a ^ so °^ ^e same form. That completes Step 2 and 
completes the proof. 

Below we shall be referring to the notions of 'ball type of radius r' (or simply r— ball 
type), 'disjoint unions' (denoted by U) and 'm-equivalence' (denoted by = m ). We 
shall also use Hanf 's theorem. The reader is referred to [8] for these concepts. 

A. C satisfies Property I 

Suppose A € Cp- If there exists a bound tia, such that all paths in A have length less 
than tia, then A € C B and hence we are done. Contrarily, suppose that there is no such 
bound n>A- This means that either there are paths of arbitrarily large lengths in A or 
there is atleast one infinite path in A (Let us mark this inference as [*]). Now, construct 
structure D G C' B , where D = A U \Ji=" l+2 P ' where p 

is a path which is infinite in 

both directions and U denotes disjoint union. We show that A = D, by showing that 
for every n G N, A = n D. By Hanf's theorem [8], given n, there exist numbers r 
and q, dependent only on n, such that A = n D if for each ball type £ of radius r, the 
number of instances of £ in A and D are either equal or are both are greater than q. By 
adding (B + m + 2) paths, we are introducing infinite copies of just one i — ball type 
£ in D, namely the 2r + 1 length path with the ball center as the midpoint. However, 
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this type £ was already present infinitely many times in A (due to [*]). Hence Hanf 's 
condition holds for every type £, and thus, A = D. 

B. If Mi 6 C' B is such that Mi |= T Mp , then M 1 \=(f> 

Before, we proceed, we state and prove the following lemma. Below, an 'outwardly' 
(resp. 'inwardly') infinite path is an oriented infinite path with an end point which has 
an outgoing (resp. incoming) edge and no incoming (resp. outgoing) edge. 

Lemma 13 For every m € N and structure G E C, there exists a substructure G m C 
G, such that G rn = m G and G m has 

— atmost finitely many finite paths 

— atmost in paths which are outwardly-infinite 

— atmost m paths which are inwardly-infinite 

— atmost 1 path which is bidirectionally -infinite 

Proof: By Hanf's Theorem, there exists t m G N, such that any two paths of length 
greater than t m are m-equivalent. For any graph GeC, define the following, 

— for jgN, let af be the number of i length paths 

— af be the number of outwardly-infinite paths 

— af be the number of inwardly-infinite paths 

— be the number of bidirectionally-infinite paths 
Given G, consider G m C G given as, 

— for i e {0, • • • , t m }, af = min(af, m) 

oo 

- a tT+i =min ( E af,m) 
i=i m +l 

— fori > 0™ + 1), af m = 

— aS = min(a!jf, m) 

— a^ m = min(a9 , m) 

— m = min(a^ , 1) 

By Hanf's theorem, it is easy to see that G m C G and G m = m G. ■ 

Suppose Mi € C' B is such that Mi |= T Mp . To show that Mi |= 0, we show that there 
exists a substructure M2 of Mp such that Mi = m M2 (recall that P is a model of <f> 
and M p is the expansion of P with the elements of a chosen core Z as interpretations 
of the B constants). Since <fi £ PSC(B), P \= <f>, and any substructure of Mp would 
contain the core Z of P, we have that M2 |= 0. And since Mi = m M2, we would 
have Mi |= 4>. 

Consider the partition of Mp into two parts Mp i and Mp t 2, where Mp i is substruc- 
ture containing all those paths in Mp which contain the interpretation of atleast one 
of the constants ci, • • • ,cp and Mp t % contains all the paths in Mp which are not in 
Mp t \. Similarly, consider the partition of Mi into M^i and Mi 2- There are two cases 
to consider. 

Case 1 : There exists a bound np such that all paths in P (and Mp) have length less 
than n p 
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Note that since Mi |= T AIp , for every finite substructure of Mi, there exists an iso- 
morphic substructure of Mp. And since all paths in Mp have length less than np, 
we have that all paths in Mi have length less than n p as well. Consider the substruc- 
ture Mf = M M U M™ 2 C Afx (where M™ 2 is as defined in Lemma 13). Clearly, 
Mi = m M\. Moreover, since both M\ % i and M{ n 2 are finite, Mf is finite, hence there 
exists a substructure M 2 C Mp, such that Mf and M 2 are isomorphic. And since 
M<i \= <t> (see above for the reasoning), we have Mf |= <f> and hence Mi |= cj) (since 
Mf = m Mi). 

Case 2 : There are atleast (B+m+2) paths in Mp which are infinite in both directions. 
Consider a path L in Mi containing the interpretation a, of a constant c,. Since Mi |= 
r Mp , one can see that L must be a subpath of some path in Mp - infact subpath of some 
path in Mp^. Thus, arguing similarly for each path L C Mi i, we have Mi.i C Mp^. 
Also, since there are (B + m + 2) bidirectional-infinite paths in Mp, atleast (m + 2) 
of these would be present in Mp^. Now, since M{™ 2 C M12 (as defined in Lemma 
13) contains, 

- finitely many finite paths - all of these can be embedded in a single bidirectional 
infinite path 

- atmost m outwardly-infinite and atmost m inwardly-infinite paths - all of these 
can be embedded in m bidirectional-infinite paths 

- atmost 1 bidirectional-infinite path : can be embedded in a single bidirectional- 
infinite path. 

it follows that M^ 2 can be embedded into M P>2 . Thus, Mf = M M U M™ 2 C 
M Pt i U Mp-2 = M P . Hence Mf |= 0. And since M{™ 2 = m Mi j2 , we have 
Mf = m Mi, and hence Mi |= 0. 

Thus, we have shown that if Mi £ C' B and Mi |= T A/p , then Mi \= <j>. ■ 
We now look at some classes of structures over which Theorem 4 fails. 

5 Theorem 4 fails over special classes of structures 

We first look at the class T of all finite structures. Los-Tarski theorem fails over this 
class and hence so does Theorem 4 (for B = 0). However, we have the following 
stronger result. We prove it for relational vocabularies (constants permitted). 

Lemma 14 For relational vocabularies, Theorem 4 fails, over J-,for each B > 0. 

Proof: We refer to [1] for the counterexample \ f° r Los-Tarski in the finite. Let r be 
the vocabulary of \ (i- e - {<, S, a, b}) along with a unary predicate U. Let us call an 
element x as having colour in a structure if U (x) is true in the structure and having 
colour 1 otherwise. Let ip be a sentence asserting that there are exactly B elements hav- 
ing colour and these are different from a and b. Then consider <j> = ~~<X A <p. Check 
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that since ->x is preserved under substructures in the finite, in any model of 0, the B 
elements of colour form a core of the model w.r.t. </>. Then <fi <G PSC(B). Suppose 
4> is equivalent to ip given by 3 B xM n y f3 where (3 is quantifier-free. Observe that in 
any model of <fi and ip, any witness for ip must include all the B elements of colour 
(else the substructure formed by the witness would not model Lp and hence <p, though it 
would model ip). Consider the structure M = ({0, 1, . . . , B + 2n + 3}, <,S,a, b, U) 
where < is the usual linear order on numbers, S is the (full) successor relation of <, 
a = 0,b = B + 2n + 3 and U = {1, . . . , B}. Now M ^ (p since M ^ 
Consider Mi which is identical to M except that S(B + n + l,y) is false in Mi for 
all y. Then Mi |= <p so that Mi |= ip. Any witness a for ip must include all the 
B colour elements of Mi. Then choose exactly the same value, namely a, from 
M to assign to x. Choose any b as y from M. Check that it is possible to choose d 
as y from Mi s.t. M(ab) is isomorphic to Mi(ad) under the isomorphism / given 
by /(0) = 0J{B_+ 2n + 3) = B + 2n + 3, /(a,) = a t and /(&<) = d t where 
a = (ai, . . . , as), b = (pi, . . . , b n ) and d = (di, . . . , d n ). Then since Mi |= /3(a, d), 
M |= (3(a,b). Then M models tjj, and hence cf>. But that is a contradiction. ■ 

The example expressed by x can a l so be written as a sentence in a purely relational 
vocabulary. The sentence <fi below is over the vocabulary r = {<, S, U}. We leave 
it to the reader to reason out (in the same manner as in [1]) that <fi is preserved under 
substructures in the finite but is not equivalent to any universal sentence. 

<t> = Xi A x 2 A X3 where 
Xi = VxVyVz ((x < x) A ((x < y) V (y < x)) A 

((x < y) A (y < z)) -> (x < z)) 
X2 = VxVy S(x,y) ->■ Vz(((x < z) A (x ^ z)) ->• < zj) 

X3 = 3z\/x!Vx 2 (/\ l ^l^U(x t ) A (x! ^ x 2 )) ^ (X4,(xi,x 2 ,z)\/ X4{x2,xi,z)) 
Xa(xi,x 2 ,z) = Vy ((xi < y) A (y < x 2 )/\ 

((y^x 1 )A(y^x 2 ))^U(y)A 

((z^x 2 )A^S{z,y))) 

Then one can do a similar proof as above to show that for purely relational vocabularies 
too, for each B > 0, Theorem 4 fails over T. 

So far, in all the cases we have seen, it has always been the case that Theorem 4 and 
Los-Tarski theorem either are both true or are both false. We then finally have the 
following result which is our first instance of a class of structures over which Los- 
Tarski theorem holds but Theorem 4 fails. 

Theorem 10 Over the class C of graphs in which each graph is a finite collection of 
finite undirected paths, for each B > 2, there is a sentence <j) £ PSC(B) which is not 
equivalent to any 3 B V* sentence. However, Los-Tarski theorem holds over C. 

Proof: Los-Tarski theorem holds from the results of Dawar et al. over bounded degree 
structures [2]. As a counterexample to Theorem 4 for B > 2, consider condition Di, 
parametrized by B, which asserts that there are atleast B paths (0 length included) in 
the graph. We show that this is FO definable because the following equivalent condition 
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D 2 , parametrized by B, is FO definable: (The number of nodes of degree 0) + \ x (the 
number of nodes of degree 1) > B. We briefly explain this equivalence between D\ 
and D 2 . Consider a graph satisfying D\. Let k be the number of 0-length paths so that 
there are atleast B — k paths of length > 1. Each of the latter paths has exactly 2 nodes 
of degree 1. Then it is easy to check that condition D 2 holds. Conversely, suppose 
a graph satisfies D 2 , but it has less than B paths. Let k be the number of 0-length 
paths so that there are atmost B — 1 — k paths of length > 1. Each of the latter paths 
has exactly 2 nodes of degree 1 . Then, (the number of nodes of degree 0) + \ x (the 
number of nodes of degree 1) < (fc + | x2x (B — 1 — k)) < B - contradicting D\. 
Then D 2 implies D\. 

Then, given B, D\ is expressible by a FO sentence <f> since D 2 is FO expressible (the 
latter is easy to see). 

To see that <fi is in PSC(B), in any model, observe that the set of nodes formed by 
picking up one end point each of B distinct paths is a core. 

Now suppose that <f> is equivalent over C to ip = 3 B x \/ n y /3(x, y) for some n G N 
where j3 is quantifier-free. Consider a graph M which has exactly [-j] paths, each 
of length > 5n (There is nothing sacrosanct about the number 5 - it is just sufficiently 
large for our purposes). By definition, M Y= <t> and hence M Y= 4>- Label the end points 
of these paths as pi,p2,P3, ■ ■ ■ ,P2k where k = Hjl . Now consider a graph N having 
exactly B paths, each of length > 5n . By definition, N \= <fi and hence N \= tp. 
Then there exists a witness a = (ai, . . . , as) in N for ip. Observe that no two of the 
diS can be in the same path else taking the substructure of N formed by just the paths 
containing a, one would get a model of ip and hence <fr - but the number of paths in this 
model would be < B — 1, giving a contradiction. We now choose points b\, . . . , bs 
in M as follows. For i € {1, . . . , B}, if «j is at a distance of atmost n from any end 
point in N, then choose bi to be at the same distance from pi in M. Else choose 6j 
to be at a distance of n from Pi in M. Assigning b = (b±, . . . , 6s) as x, choose any 
d as y from M. Check that it is possible to choose e as y from N s.t. M (bd) is iso- 
morphic to N(ae) under the isomorphism / given by /(&,) = ai, f(dj) = Cj where 
d= (di,...,d„) ande = (e 1; ...,e„). Since N |= /?(a,e), M \= /3(b,d). Then AI 
models ip — & contradiction. ■ 

Important Note: For B = 2, the sentence <f> above is equivalent to asserting that either 
(i) there are atleast 2 nodes of degree exactly or (ii) there are atleast 3 nodes of degree 
atmost 1. Consider the following condition for B > 2 whose special case for B = 2 
is the condition just mentioned: Either (i) there are atleast B nodes of degree exactly 
or (ii) there are atleast B + 1 nodes of degree atmost 1 . This condition, for a given B, 
is easily seen to be expressible as a FO sentence £ (in fact, £ is of the form 3 B+1 V*). 
But for B > 2, £ ^ PSC(B). To see this, consider a graph M containing exactly 2 
paths Pi and P 2 of length > 1 and B — 3 paths of length (the total number of paths 
is then < B). We will show that M has no core (w.r.t. £) of size atmost B. Firstly, 
M h £ since M has B + 1 nodes of degree atmost 1. If £ G PSC(B), then M has 
a core C of size atmost B. There are 2 cases: (a) One of Pi or P 2 has atmost 1 core 
element, (b) Both Pi and P 2 have atleast 2 core elements. In case of (b), note that 
atleast one of the 0-length paths will not contain any core element. Then consider the 
substructure Mi of M without this path - this contains all core elements and hence 
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must satisfy £. However, there are exactly B elements of degree atmost 1 in Mi and 
hence Mi violates £. In case of (a), there are two subcases: (i) One of Pi or P%, say 
Pi w.l.o.g., contains no core element. Then the substructure Mi of M which is all 
of M, but excluding Pi, contains all core elements and must hence model £. But Mi 
contains exactly B — 1 nodes of degree atmost 1; so it violates £. (ii) One of Pi or P^, 
say Pi w.l.o.g., contains exactly 1 core element say a. Let Mi be the substructure of 
M without Pi. Consider the disjoint union M3 of Mi and the substructure M2 of M 
induced by a. Then M3 C M contains all core elements and must hence model £. But 
M3 contains exactly B nodes of degree atmost 1 ; so it violates £. 
In all cases, we have a contradiction. Hence M has no core of size < B. Hence 
£ i PSC(B). 

Interestingly however, Theorem 4 holds over C for B = 1 as we shall see in the next 
Lemma. We also give a simpler proof for the case of B — i.e. Los-Tarski over C. 

Lemma 15 Over C, for B < 1, <f> G PSC(B) iff <p is equivalent to i\) where -0 = 
3 B x \/ n y <f>\xy for some n G N. 

Proof: Let the quantifier rank of <fi be m. By Hanf's theorem, we have the following: 

A There exists a number t m G N such that any two undirected paths of length greater 
than t m are m— equivalent. 

B There exists a number s m G N such that given a structure G = (P, a) where P e C 
is (finite) path of length greater than s m and a is a designated element of P, there 
is a substructure Gi = (Pi, a) of G s.t. (i) Pi is a subpath of P containing the 
designated element a, (ii) \Pi \ < s m and (iii) G = m G\. 

C For any graph G G C, let af be the number of undirected paths of length i in G. 
Now, given graph G G C, we consider a graph G m C Gas follows (similar to the 
method in the proof of Theorem 9): 

- for i G {0, • • • , t m }, af m = min(af , m) 

00 

- a^ m +1 = min( £ a f > m ) 

i=i m +i 

- for i > (t m + 1), af ™ = 

By Hanf's theorem, it is easy to verify that G m = m G. 
Now consider the statement of the (current) lemma for B = 1. Let n = s m + 

i=t m + l 

(m ■ (i + 1)) and consider ip given by ip = 3x \/ n y 4>\ X y. That <j) ip fol- 

i=0 

lows from Lemma 5. For the converse, suppose G \= ip. Let a be a witness and let P 
be the path in G on which a appears. Consider the vocabulary Ti = {E} U {ci} where 
c x is a fresh constant and consider Q = (G, a) - the r x -structure obtained by expanding 
G with a as the interpretation for c\. Let Q = Qi U G2 where Q\ = (P, a) and G2 G C 
is the collection of all paths in G other than P. Note that we have abused the U notation 
slightly but the idea of separating P and a from the rest of G is clear. Now, 
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— Let Q[ C Q 1 be the structure ensured by [B] above. Then (i) \Q[\ < s m and (ii) 
G[ =m Gi- 

i=t m +i 

-Let Go™ be as given by [C] above. Then(i)G^ C G 2 , (ii) |G" l | < £ m-(i + l) 
and (hi) G 2 " = m G 2 . 

Then ^ = (G[ U G 2 ") = m (Si UG 2 ) =G- Also 0' C 0. Note that \Q'\ < s m + 
»=t m +i 

m • (i + 1) = n. Now since G |= choose x = a and y = d where d is any 

i=o 

tuple containing exactly the elements of G' - this is possible since \G'\ < fias we just 
saw. Then (G, a, d) |= (f>\xy so that £7' |= </>. Then G \= 4> an d hence G |= </>. 
For i? = 0, there is no Gi and hence no Q[. It is easy to see that the same proof goes 
through. ■ 

6 Additional observations on Los-Tarski theorem over 
the class of all finite structures 

We will refer to truth or failure of Los-Tarski over the class of all finite structures 
simply as the truth or failure of Los-Tarski 'in the finite'. 

Now as observed earlier in Sections 4 and 5, while Los-Tarski fails in the finite, there 
are special fragments of FO for which Los-Tarski is true in the finite. We present 
below two additional fragments of FO for which Los-Tarski is true in the finite. This 
would follow from their combinatorial proofs and hence we state the results below for 
arbitrary structures. 

Lemma 16 Consider <fi of the form 3xWyip(x,y) in a purely relational vocabulary 
t. If 4> £ PS, then (f> is equivalent to ip = Vzi . . . yz n (f>\t Zlt ... tZn \ where n = 2' r L 
Further, this bound is tight i.e. there is a 3V sentence in PS which is not equivalent to 
a universal sentence with less than n quantifiers. 

Proof: 

From Lemma 5, it follows that if M \= <j> then M \= Lp. Therefore to prove the lemma, 
it suffices to show that if M \= ip, that is, every substructure of M with size atmost 
n is a model of <f>, then infact M \= <fi. We prove it by contradiction, so assume that 
M (= (p A The main idea is to use M to come up with a structure which models <j>, 
but which has a substructure which is a non-model of <fi. This contradicts that <j> e PS. 
[Note that \M\ > n for such an M, since if \M\ < n and M \= ip then M \= (j) as 
well.] 

Since every substructure of M with size atmost n models <j>, every 1 sized substructure 
of M is a model of <\>, and hence -(/;(£; x) is true for every x G M (recall that (f> = 
3x\/ytp{x, y)). Now note that n = 2' T I is the number of all 1-types possible over the 
vocabulary t upto equivalence (An i-type of r is a quantifier-free formula over t which 
uses just i variables. The number of i types is finite upto equivalence. See [8] where 
our «-type is called rank-0, i-type). Denote the 1-types as {cto, • • • , a n -i}, and <Ji(x) 
denotes that x is of 1-type Oi. Suppose that there exists an element xq of 1-type Gi in 
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M. Since M \= \/x3y^tp(x,y), there exists a yo such that il>(xo,yo) is false in M. 
However, since every substructure of size atmost n is a model of <fr, the substructure 
M({xq, yo}) h= 4> an d hence ip(yo, xq) must be true in M (since either Xq or y must 
act as a witness for x in 0. But ijj(xo,yo) is false. Hence xo cannot be the witness). 
Let yo be of 1-type a^. 

Suppose that it is possible to have a structure A with just two elements {ao, ai} such 
that (Ti(ao), Ci(ai) and -^(ao, ai) hold. Then consider the structure X with universe 
{a , ai, 02, &} such that (i) ai(aj) holds for j G {0, 1, 2} (ii) <7k{b) holds 
(hi) -iip(aj, a (j+i)mod 3) holds for j G {0, 1, 2} (iv) tp(b, dj) holds for j G {0, 1, 2} and 
(v) ip(b, b). Such a structure exists because all the 1 -types and 2-types have been copied 
from other structures, namely, (i), (iii) are copied from A and (ii), (iv), (v) copied from 
M. Clearly, X |= <fi, since b <E X acts as a witness for x in </>. However, the substructure 
of X induced by {ao, ai, 02} Y= (j). This contradicts the given assumption of <j> € PS. 
Hence, it is not possible to have a structure A as assumed, and hence taking a structure 
A' with two elements ao, ai such that o"j(ao), <Jj(ai) hold, necessitates that ip(ao,ai) 
must hold (Note that for every 1-type o~i in M, one can construct such an A^). 
Consider M' to be a substructure of M which contains exactly one element of each 1- 
type present in M. Clearly \M'\ < n and hence M' (= <f>. Thus, there exists x\ G A/' 
such that for every y\ G M', ijj( x i, Vi) holds. Suppose that ai(x\) holds. Construct 
an extension M of M with an additional element zo such that (i) 07 (zo) holds (ii) 
\/y G M tp(zo, y) holds (iii) ip(zo, Zo) holds. Such a structure M exists because all the 

1 - types and 2-types have been copied from other structures, namely (i), (iii) are copied 
from M, (ii) is copied from M for y satisfying -*<Ti(y), and for y satisfying cri{y), the 

2- type is copied from A',. Clearly, M \= 4> as zo G M acts a witness for x in 0. How- 
ever, M C M and M ^ 0. This again contradicts that tfi G PS 1 . Hence, our original 
assumption that there exists AI such that M \= <p A -1^ is incorrect. Then ip —> <fr. 

To prove the optimality of the bound, consider the following example over a vocab- 
ulary of k unary predicates. We construct a formula <fi such that the smallest n for 
which 4> o Vzi • • • Vzn0|/ Zl) .. 2n j is infact n = 2 k . Suppose for contradiction 
that (j> O Vzi . . .\/z n ip. Then by Lemma 4, -f> Vzi • ■ ■ Vz n _i(/)|{ 2li ... j2?i _ 1 }.. Let 
{do, • • • , er„_i} be the set of all 1-types. 

Tl-l 

Define <fi = 3x\/y f\ (cri(x) — > -i(T( i+1 ) mo( j „(?/))■ It is easy to check that the se- 

i=0 

mantic interpretation of <fs implies that M (= <j> if and only if there exists atleast one 
1-type <Tj which is not present in M. Now consider the structure M which has ex- 
actly one copy of each 1-type <Tj. Clearly, in every substructure of M which has size 
less than or equal to n — 1, there exists atleast one 1-type which is missing. Hence 
M \= Vzi ■ ■ ■ yz n -icf)\s Zl! ... iZn _ 1 }. However, M ^ 4> as all 1-types are present in M. 
This is a contradiction. Hence (f> Vzi • • • Vz n _i</>|{ 2li ... )2n _ 1 }, and thus, the bound 
n = 2' T ' is optimal. ■ 

Lemma 17 Lef t be a purely relational vocabulary and (f> be a sentence in FO(t) s.t. 
(i) 4> = 3xi . . . 3xkVyil>(xi, . . . , Xk, y) where ip is quantifier free and no 3 variable 
is compared with a V variable using equality ( ii) <fi G PS. Then </> is equivalent to 
f = Vzi . . . V2n<^|{zi,..., 2 »} where n is 2' T L 
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Proof: 

From Lemma 5, we have <j> — > <P- Therefore to prove the lemma, it suffices to show 
that if M \= (p, that is, every substructure of M with size atmost n is a model of <fi, 
then infact M \= <j>. We prove it by contradiction, so assume that M \= ip A -i<j>. The 
main idea is to use M to come up with a structure which models 0, but which has a 
substructure which is a non-model of <p. This contradicts that <f> G PS. [Note that 
\M\ > n, since if \M\ < n and M |= ip then M (= <f> as well.] 

Consider M' to be a substructure of M which contains exactly one element of each 
1-type present in M. Clearly \M'\ < n and hence M' |= <f>. Thus, there exists 
ax, ■ ■ ■ , a-k € M' such that for every b G M', ip(ax, ■ ■ • , a*, 6) holds. Construct an ex- 
tension M of M with fc additional elements {zx, ■ ■ ■ , Zk] such that (i) M({zi, ■ ■ ■ , Zk}) 
is isomorphic to M'({ai, ■ ■ ■ , at}) via the isomorphism f{z{) = at 
(ii) My £ M i/)(zx, ■ ■ ■ , Zk,y) holds. Such a structure M exists because all r-types 
(r < k + 1) have been obtained by copying predicate values from other structures as 
now explained. The types in (i) are copied from M'. The types in (ii) are copied as 
they are from M' as follows: suppose yo G M' has the same 1-type as y £ M, then 
r-type {zx, ■ • • , Zk,y} m M (where r is the number of distinct elements present in 
{zx, ■ ■ ■ , Zk,y}) is obtained by having all propositional statements, a(zi, ■ ■ ■ ,Zk,y) 
to have the same value in M as a(ax, ••• , dfe, yo) in M', where there is no equality 
between y and Zj in a. Then, ip(zx, ■ ■ ■ ,Zk, yo) is true in M, as ^(ai, • ■ ■ , ajt, yo) 
is true in M', Also, since there are no equality comparisons between Zj and y in ip, 
ip(zx, ■ ■ ■ ,Zk,y) has the same value as %p(ax, • ■ ■ , ak, yo), even if yo was infact one 
of the dj& itself. Thus, we have M \= <f> as Z\, ■ ■ ■ , Zk G M act as witnesses for 
Xx , • • • , Xk in (j>. However, M C M and M ^= This contradicts that <f> € P5. 
Hence, our original assumption that there exists M such that M \= p A ^<p is incorrect. 
Then ip — > (j>. ■ 

We now make the following important observation given our results. Over the class of 
all finite structures and for purely relational vocabularies, the following hold: 

1 . Los-Tarski holds trivially for the and fragments of FO. A S° sentence in 
PS is actually valid. There is nothing to do in the n" case. 

2. By Lemma 8, Los-Tarski holds for ifi!. 

3. The counterexample to Los-Tarski in the finite, given as a purely relational sen- 
tence </> after Lemma 14 in Section 5, is an 3V 4 sentence. Then Los-Tarski fails 
in the finite for and U° k for all k > 3. 

4. By Lemmas 16 and 17, for the 3V fragment (with equality) of and the 3*V 
fragment (with restricted equality) of £3, Los-Tarski holds. 

This then leaves open only the following cases to investigate for Los-Tarski in the finite 
for purely relational vocabularies. 

1. Full 3*V fragment (in particular, the 'with-equality' case) 

2. 3*V 2 
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3. 3*V 3 

4. 3V 4 without equality 

Any resolution of all these cases would give a complete characterization of the dividing 
line in the class of prefix fragments of FO, over purely relational vocabularies, between 
those prefix fragments for which Los-Tarski holds in the finite and those for which it 
does not! 

We are currently trying to see if Lemmas 16 and 17 go through for relational vocabu- 
laries too (constants permitted). If so, then observing that the counterexample x men- 
tioned in the proof of Lemma 14 is a 3V 3 sentence, the only cases left to investigate 
would be the above cases of (1) and (2) and finally the 3V 3 fragment without equality. 
With any resolution of these cases, we would get a complete characterization of the di- 
viding line in the class of prefix fragments of FO, over relational vocabularies, between 
those prefix fragments for which Los-Tarski holds in the finite and those for which it 
does not. 

7 Proof of Theorem 4 

We first introduce some notations. Given a vocabulary r, we denote by Tfc, the vo- 
cabulary obtained by expanding t with fc-fresh constants, say ci, . . . , Cfe. Given a r- 
structure M and k elements bi, . . . , bk from M, we denote by (M, bi, . . . , the 
Tfc -structure whose r-reduct is M and in which the constant Ci is interpreted as bi for 
1 < i < k. Finally, for a r-structure M, we denote by \M\, the power of M, i.e. the 
cardinality of the universe of M. 

We begin with the following definition. 

Definition 3 (k-cover) Given a r-structure M, we call a set K of t- structures as a 
fc-cover of M if (i) N C M for each N £ K ( ii) the union of the universes of the 
elements of K is the universe of M and (Hi) for every atmost k-sized subset S of the 
universe of M, there exists an element of K containing S. We call M as the union of 
K and denote M as (J K. 

Note that given M, there always exists a fc-cover of it - choose the set K above as 
{M}. 

Definition 4 (Preservation under k-covers) A FO(r)-sentence <f) is said to be pre- 
served under k-covers, if for all r-structures M and all k-covers K of M, if every 
structure in K satisfies <fi, then M satisfies (j). 

We will assume familiarity with the notion of saturations described in [5] and recall 
now the following theorems from [5] which we will use subsequently. 

Proposition 1 (A special case of Proposition 5.1.1(iii) in [5]) Given an infinite cardi- 
nal X and a X-saturated structure M, for every k-tuple (ax, ■ • ■ , ctfc) of elements from 
M where k 6 N, (M, a\, . . . ,ak) is also X-saturated. 
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Proposition 2 (Proposition 5.1.2(H) in [5]) M is finite iff M is X— saturated for all 
cardinals X. 

Theorem 11 (A special case of Lemma 5.1.4 in [5]) Let r be a finite vocabulary, A be 
an infinite cardinal and M be a T-structure such that tu < \M\ < 2 X . Then there is a 
fi-saturated elementary extension of M for f3 > A. 

Theorem 12 (Lemma 5.2.1 in [5]) Given T-structures M and N and a cardinal A, 
suppose that (i) M is X-saturated (ii) A > \N\ and (Hi) every existential sentence true 
in N is also true in M. Then N is embeddable in M. 

Putting Theorem 1 1 and Proposition 2 together we get the following. 

Corollary 3 For every T~structure M, there exists a /3-saturated elementary exten- 
sion of M for some cardinal (3 > u. 

Towards our syntactic characterization, we first prove the following. 

Lemma 18 Given a finite vocabulary r, consider a FO(t)- sentence (j) which is pre- 
served under k-covers and let T be the set of all V fe 3* consequences of '(f). Then for all 
infinite cardinals X, for every X-saturated structure M, if M |= T, then M \= <fi. 

Proof: 

If 4> is either unsatisfiable or valid, then the result is immediate. 

Else, consider M satisfying the assumptions above. To show that M \= <fi, it suf- 
fices to show that for every atmost fc-sized subset S of the universe of M, there is 
a substructure M s of M containing S such that M s models <fi. Then the set K = 
{M S \S is an atmost fc-sized subset of the universe of M} forms a fc-cover of M. Fur- 
ther since <f> is preserved under fc-covers, M |= <fi. 

Let <zi, . . . , Ofc be the elements of a subset S of the universe of M. To show the exis- 
tence of M s , it suffices to show that there exists a Tfc-structure N s.t. (i) N is of power 
atmost A (ii) every 3* sentence true in N is also true in (M, a\, . . . , a^) (ii) N \= 4>. 
Since M is A-saturated, by Proposition 1, (M, oi, . . . , flfe) is also A-saturated. Then 
from Theorem 12, N is embeddable into (M, ai, . . . , Ofc). Then the r-reduct of the 
copy of N in (M, oi, . . . , a*) can be taken to be M s referred to above. 
We now show the existence of N to complete the proof. 

Let P be the set of all V* sentences of FO(rk) which are true in (M, oi, . . . , a^). 
Consider the set T = {<fi} U P. Suppose T is unsatisfiable. Then by Compactness 
theorem, there is a finite subset of T which is unsatisfiable. Since P is closed un- 
der taking finite conjunctions and since each of P and <fi is satisfiable, there exists a 
sentence ip in P s.t. {(f>,ip} is unsatisfiable. Then <fi — > -iijj. Now is a FO(t) 
sentence while ip is a FO(rk) sentence. Then by V-introduction, <fi — > ip where 
ip = Vxi . . . Vxfc-i^[ci i— > X\\ . . . ; Cfe \-> Xk] where x\, . . . , Xk are fc fresh variables 
and Cj h-> Xi denotes replacement of q by Xj. Now note that since tp is a V* sentence, 
is a 3* sentence (in FOfa)) and hence ip is a V fe 3* sentence (in FO(t)). Then 
ip G r so that M |= ip. Then (M, oi, . . . , ttfc) |= This contradicts the fact that 

ip e P. 
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Then T is satisfiable. By Lowenheim-Skolem theorem, there is a model TV of T of 
power atmost A. Since TV models every V* sentence true in (M, oi, . . . , a,k), it follows 
that every 3* sentence true in TV is true in (M, a%, . . . , a^). Finally, since TV \= 0, TV 
is indeed as desired. ■ 

Theorem 13 Given a finite vocabulary r, a FO(r)-sentence is preserved under It- 
covers iff it is equivalent to a V fe 3* sentence. 

Proof: 

Let r be the set of all V fe 3* consequences of 0. It is easy to see that — > T. For the 
converse direction, suppose AI \= T. By Corollary 3, there is a j3— saturated elementary 
extension M + of AI for some /3 > u>. Then M + |= T. Then from Lemma 18, it follows 
that M + |= 0. Since AI + is elementarily equivalent to AI, we have that AI \= 0. 
Then T — > <j> and hence <fi <r> V. By Compactness theorem, <j> is equivalent to a finite 
conjunction of sentences of T. Since T is closed under finite conjunctions, <fi is equiva- 
lent to a V fc 3* sentence. ■ 



We now prove Theorem 4. 



Theorem 4 Given a finite vocabulary r, a FO(t) sentence is in PSC(B) iff it is 
equivalent to a 3 S V* sentence. 

Proof: We infer from Theorem 13 the following equivalences. 
4> is equivalent to a 3 B V* sentence iff 

is equivalent to a \/ B 3* sentence iff 
For all T-structures M and all B-covers K of M, if ViV e if, iV |= then M |= ^0 
iff 

For all r-structures M and all B-covers K of M, if M (= then 3N e K, N ^ 4> 

Assume <j> G PSC(B). Suppose K is a S-cover of M and that AI \= <f>. Since 
G PSC(B), there exists a core C of M of size atmost B. Then by definition of 
i?-cover, there exists N G K s.t. (i) N contains C and (ii) N C M. Then since C 
is a core of M, TV |= by definition of PSC(B). Then by the equivalences shown 
above, is equivalent to a 3 B V* sentence. It is easy to see that an 3 S V* sentence is in 
PSC(B). U 



8 Conclusion and Future Work 

For future work, we would like to investigate cases for which combinatorial proofs of 
Theorem 4 can be obtained. This would potentially improve our understanding of the 
conditions under which combinatorial proofs can be obtained for the Los-Tarski theo- 
rem as well. An important direction of future work is to investigate whether Theorem 4 
holds for important classes of finite structures for which the Los-Tarski theorem holds. 
Examples of such classes include those considered by Atserias et al. in [2]. We have 
also partially investigated how preservation theorems can be used to show FO inex- 
pressibility for many typical examples (see [12]). We would like to pursue this line of 
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work as well in future. 
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